DL_POLY_4 and Xeon Phi: Lessons Learnt

نویسندگان

  • Alin Marin Elena
  • Christian Lalanne
  • Victor Gamayunov
  • Gilles Civario
  • Michael Lysaght
  • Ilian Todorov
چکیده

Molecular dynamics techniques grew rapidly in the last twenty years. The growth was fuelled by development of new scalable mathematical algorithms, availability of powerful hardware and better availability of ready to use software packages. DL_POLY is one of these packages, widely adopted by the computational physics and material science communities. DL_POLY started its life in 1992 at Daresbury Laboratory, now part of Science & Technology Facilities Council in United Kingdom, with a first public release in 1993. The main developers for the current version are W Smith and IT Todorov. DL_POLY is a general classical molecular dynamics code and was used to simulate macro molecules (both biological and synthetic), complex fluids, materials and ionic liquids. DL_POLY also plays an important role as sandbox for both development of new methods and algorithms for molecular dynamics and testing of emerging hardware technologies[1] and [2]. The core code is written in Fortran 95/2003 standards and optimised for distributed systems using domain decomposition, also OpenMP and CUDA ports exist as contributions to DL_POLY but not part of the official distribution. DL_POLY is free of use for academics pursuing non-commercial research and available for licensing for the rest. The Intel Xeon Phi co-processor is a novel accelerator technology that provides few attractive features as: many cores, 60 cores with 240 hardware threads for the mid model, low power consumption, the same set of instructions as an Intel CPU, supports popular and standardised programming models as MPI and OpenMP and a theoretical peak of 1 TFlops in double precision. In this communication we present the progress made in porting and optimising DL_POLY to Xeon Phi co-processor. The rest of the paper is organised as follows: a short introduction to the methodology used for port and optimisation, OpenMP implementation results in section III-A, synchronous offload ones in section III-B, MPI symmetric running mode in section III-C.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Porting FEASTFLOW to the Intel Xeon Phi: Lessons Learned

In this paper we report our experiences in porting the FEASTFLOW software infrastructure to the Intel Xeon Phi coprocessor. Our efforts involved both the evaluation of programming models including OpenCL, POSIX threads and OpenMP and typical optimization strategies like parallelization and vectorization. Since the straightforward porting process of the already existing OpenCL version of the cod...

متن کامل

Performance Analysis of an Astrophysical Simulation Code on the Intel Xeon Phi Architecture

We have developed the astrophysical simulation code XFLAT to study neutrino oscillations in supernovae. XFLAT is designed to utilize multiple levels of parallelism through MPI, OpenMP, and SIMD instructions (vectorization). It can run on both CPU and Xeon Phi co-processors based on the Intel Many Integrated Core Architecture (MIC). We analyze the performance of XFLAT on configurations with CPU ...

متن کامل

Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach

Modern processor technologies have driven new designs and implementations in main-memory hash joins. Recently, Intel Many Integrated Core (MIC) co-processors (commonly known as Xeon Phi) embrace emerging x86 single-chip many-core techniques. Compared with contemporary multi-core CPUs, Xeon Phi has quite di↵erent architectural features: wider SIMD instructions, many cores and hardware contexts, ...

متن کامل

Accelerating DNA Sequence Analysis using Intel Xeon Phi

Genetic information is increasing exponentially, doubling every 18 months. Analyzing this information within a reasonable amount of time requires parallel computing resources. While considerable research has addressed DNA analysis using GPUs, so far not much attention has been paid to the Intel Xeon Phi coprocessor. In this paper we present an algorithm for large-scale DNA analysis that exploit...

متن کامل

Fast computation of computer-generated hologram using Xeon Phi coprocessor

We report fast computation of computer-generated holograms (CGHs) using Xeon Phi coprocessors, which have massively x86-based processors on one chip, recently released by Intel. CGHs can generate arbitrary light wavefronts, and therefore, are promising technology for many applications: for example, three-dimensional displays, diffractive optical elements, and the generation of arbitrary beams. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016